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AMENDMENT TO THE CLAIMS 

L (currently amended) A computer-implemented method of extracting information from an 
information source comprising a plurality of d o c uments, comprising: 
accessing strings of text in the information source; and 

comparing the strings of text in the information source with generalized extraction 
patterns and identifying a plurality of strings in the information source that match 
at least one generalized extraction pattern, the generalized extraction patterns 
including words and wildcards, wherein the wildcards denote that at least one 
word in an individual string can be skipped in order to match the individual string 
to an individual generalized extraction pattern; 

extracting a first set of related elements of text pertaining to a top ic from a first string of 
the plurality of strings based on a corresponding set of related el ements pertaining 
to the topic in the at least one generalized extraction pattern, the firs t string being 
associated with a first document in the plurality of documents; 

extracting a s econd set of related elements of text pertaining to the topic from a second 
string of the plur ality of strin gs based on the corresponding set of related elements 

in the at least one generalized extraction, pattern, the second string being 

associated with a second document in the plurality of documents, wherein at least 
one of the related elements of text in the first set of related elements is different 
from, each of the related elements of text in the second set of related elements of 
text; and 

putputtin g the first related set of elements and the second set of related elements . 

2. (original) The computer-implemented method of claim 1 and further comprising processing 
the first related, set of elements and t he second set of related elements to analyze data in the 
inforamtion source ^ traetin^ in the information source that 
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hav e been identified to match, the at-4east---4wo- elements being—based e n at l e ast two- 

ee rresponding elements in a correspon di ng gen^feed--ex4raction- patterns. 

3. (currently amended) The computer-implemented method of claim £-l_wherein for at least one 
of the corresponding elements in each of the generalized extraction patterns, there is at least one 
word positioned between said at least one of the corresponding elements and the wildcards. 

4. (original) The computer-implemented method of claim 1 wherein the wildcards indicate the 
number of words that can be skipped. 

5. (currently amended) A computer-readable medium for extracting information from an 
information source comprising a plurality of d ocuments, comprising: 

a data structure including a set of generalized extraction patterns including words and an 
indication of a position for at least one optional word; and 

an extraction module using the set of generalized extraction patterns to match st rings a 
first string and a second string in the information source with one of the 
generalized extraction patterns, the first string associated with, a, first document in. 
the plurality of documents and the second string ass ociated with a , second 
document in the plurality of documents, extract a first se t of related elements of 
text pertaining to a topic from the first stri ng based on a corresponding set of 
related elements in said one of the genmlized , extraction patte rns and a second set 
of related elements of text pertainin g to the topic from the second string based on 
the corresponding set of related elements in said one of the generalized extraction 
patterns, wherein at least one of the related elements of text in th e first set of 
related elements is different from each of the related elements o f text in the 
second set of related elements of text, and output the first related set of elements 
and the second related set of elements . 
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6. (currently amended) The computer-readable medium of claim 5 wherein- the generalized 
extraction patterns - further metude—a Mteas^^ related to a subjectand further 
comprising a module adapted to process the first set of related elements of text and the second set 
of related elements of text . 

7. (currently amended) The computer-readable medium of claim 6-5wherein for the generalized 
extraction patterns there is at least one word positioned between at least one of the elements and 
the indication. 

8. (original) The computer-readable medium of claim 5 wherein the indication includes a number 
of words that can be skipped during information extraction, 

9-24 (cancelled) 

25. (new) The computer-implemented method of claim 1 wherein each of the elements of the 
first set of related elements of text are different from each of the elements of the second set of 
related elements of text. 

26. (new) The computer-implemented method of claim 1 wherein the corresponding related set 
of elements refer to general elements pertaining to the topic and the first set of related elements 
and the second set of related elements refer to specific text associated with the general elements. 

27. (new) The computer-implemented method of claim 26 wherein the corresponding related set 
of general elements include at least one of a company/product pair, a book title/author pair, an 
inventor/invention information pair and a question/answer pair. 

28. (new) The computer-implemented method of claim 27 wherein the first set of related 
elements and the second set of related elements refer to at least one of a specific company, a 
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specific product, a specific book title, a specific author, a specific inventor, a specific invention, a 
specific question and a specific answer. 

29. (new) The computer-implemented method of claim 1 wherein the plurality of documents 
include at least one of a collection of documents, news articles and a collection of customer 
feedback, 

30, (new) The computer-readable medium of claim 5 wherein each of the elements of the first set 
of related elements of text are different from each of the elements of the second set of related 
elements of text. 

3L (new) The computer- readable medium of claim 5 wherein the corresponding related set of 
elements refer to general elements pertaining to the topic and the first set of related elements and 
the second set of related elements refer to specific text associated with the general elements. 

32. (new) The computer- readable medium of claim 31 wherein the corresponding related set of 
general elements include at least one of a company/product pair, a book title/author pair, an 
inventor/invention information pair and a question/answer pair. 

33. (new) The computer- readable medium of claim 32 wherein the first set of related elements 
and the second set of related elements refer to at least one of a specific company, a specific 
product, a specific book title, a specific author, a specific inventor, a specific invention, a specific 
question and a specific answer. 

34. (new) The computer-readable medium of claim 5 wherein the plurality of documents include 
at least one of a collection of documents, news articles and a collection of customer feedback. 



